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IN THE SPECIFICATION 

Please replace the paragraph beginning at page 2, line 1 8, with the following rewritten 
paragraph: 

— Figure 1 (SEQIDNOS:2,5-8)depictsa homologylineupbetweentheGFPsofRen/7/a/Wu//eo 
(SEQ ID NO:2), PitilosarcusGurneyi (SEQ ID NO:5), Aequorea (SEQ ID NO:7) and its enhanced 
version, EGFP (SEQ ID NO:6). The underlined residues are the fluorescent tripeptide 
(chromophore). Identity, strong similarity and weak similarity are depicted. - 



Please replacethe paragraph beginning at page 2, line 21 , with the following rewritten 
paragraph: 

- Figure 2 depicts the nucleic acid sequence (SEQ ID NO: 1) of the wild type rGFP (SEQ ID 
NO:2). - 




Please replace the paragraph beginning at page 2, line 22, with the following rewritten 
paragraph: 



/ / ^? - Figure 3 depicts the nucleic acid sequence (SEQ ID NOS:3, 4) of the wild type pGFP (SEQ 

h ID NO:5). - 



Please replacethe paragraph beginning at page 2, line 24, with the following rewritten 
paragraph: 




- The present invention is directed to the use of Renilla green fluorescent protein (hereinafter 
"rGFP"), in a variety of methods and compositions that exploit the autofluorescent properties 
of rGFP. These methods include, but are not limited to, the use of rGFP as a reporter molecule 
in cell screening assays, including intracellular assays; the use of rGFP as a scaffold protein 
for fusions with random peptide libraries; etc. Similarly, compositions of rGFP are provided, 
including constructs of rGFP such as fusion constructs that include rGFP as a reporter gene, 
retroviral constructs including rGFP and internal ribosome entry sites (IRES), etc. Basically, 
the invention provides a number of novel uses for rGFP, similar to those outlined for aGFP 
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in WO 95/07463, hereby incorporated by reference in its entirety. In addition, the invention 
isalsodirectedtotheuseof PttilosarcusGurneyigreen fluorescent protein ("pGFP"), the amino 
acid sequence of which is shown in Figure 1 (SEQ ID NO:5) and is also depicted in WO 
99/4901 9. It should be noted that while the discussion below is directed to rGFP, pGFP may 
be used as well. - 



Please replacethe paragraph beginning at page 3, line 1 0, with the following rewritten 
paragraph: 



- In a preferredembodiment, the invention provides compositions includingrGFP. By "Renilla 
green fluorescent protein" or "rGFP" herein is meant a protein that has significant homology, 
as defined herein, to the wild-type protein of Figure 1 (SEQ ID NO:2), as depicted in WO 
99/49019, hereby incorporated by reference in its entirety. - 



Please replacethe paragraphbeginning at page 3, line 14, with the following rewritten- 
paragraph: 





— In a preferred embodiment, the invention provides compositions including pGFP. By 
Ptilosarcus green fluorescent protein" or "pGFP" herein is meant a protein that has significant 
homology, as defined herein, to the wild-type protein of Figure 3 (SEQ ID NO:5), as depicted 
in WO 99/49019, hereby incorporated by reference in its entirety. - 



Please replacethe paragraph beginning at page 37lme T8,with1he following-rewritten 
paragraph: 



-An rGFPorpGFP proteinof the present invention may be identified in several ways. "Protein" 
in this sense includes proteins, poly peptides.and peptides. A r- or pGFP nucleic acid or protein 
is initially identified by substantial nucleic acid and/or amino acid sequence homology to the 
sequences shown in Figures 1, 2 and 3 (SEQ ID NOS:1-8). Such homology can be based 
upon the overall nucleic acid or amino acid sequence. - 



Please replace the paragraph beginning at page 3, line 23 J withfhefoirowing rewritten- 
paragraph: 
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- As used herein, a protein is a "rGFP protein" or "pGFP" if the overall homology of the protein 
sequence to the amino acid sequence shown in Figures 2 or 3 (SEQ I D NOS:2, 5) is preferably 
greaterthan about 75%, more preferably greaterthanabout80%, even more preferably greater 
than about 85% and most preferably greater than 90%. In some embodiments the homology 
will be as high as about 93 to 95 or 98%. - 



Please replace the paragraph beginning at page 5, line 5, with the following rewritten 
paragraph: 




- The alignment may include the introduction of gaps in the sequences to be aligned. In 
addition, for sequences which contain either more or fewer amino acids than the protein 
sequences shown in Figure 1 (SEQ ID NOS:2, 5-8), it is understood that the percentage of 
homology will be determined based on the number of homologous amino acids in relation to 
the total number of amino acids. Thus, for example, homology of sequences shorter than that 
shown in Figure 1 (SEQ ID NOS:2, 5-8), as discussed below, will be determined using the 
number of amino acids in the shorter sequence. — 



Please replacethe paragraph beginning at page 5, line 1 1 , with the following rewritten 
paragraph: 




-GFPproteinsofthepresentinventionmay be shorteror longer than theaminoacid sequences 
shown in Figure 1 (SEQ ID NOS:2, 5-8). Thus, in a preferred embodiment, included within 
the definition of GFP proteins are portions or fragments of the sequences depicted herein. 
Portions or fragments of r- and pGFP proteins are considered GFP proteins if a) they share 
at least one antigenic epitope; or b) have at least the indicated homology; c) preferably have 
GFP biological activity, e.g., including, but not limited to, autofluorescence; or d) fold into a 
stable structure that is similar to the wild-type structure. - 



Please replacethe paragraph beginning at page 13, line 5, with the following rewritten 
paragraph: 
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MGC AALESEVSALESEVASj^SEVML GRGDMP LAAVKSjaSAVKSKLASVKSKLAA CGPP 
(SEQ ID NO:9). The underlined regions represent a coiled-coil leucine zipper region defined 
previously (see Martin etal., EMBO J. 13(22):5303-5309 (1994), incorporated by reference). 
The bolded GRGDMP (SEQ ID NO: 10) region represents the loop structure and when 
appropriately replaced with randomized peptides (i.e. peptides, generally depicted herein as 
(X) n , where X is an amino acid residue and n is an integer of at least 5 or 6) can be of variable 
length. The replacementofthe bolded region is facilitatedby encoding restrictionendonuclease 
sites in the underlined regions, which allows the direct incorporation of randomized 
oligonucleotides at these positions. For example, a preferred embodiment generates a Xhol 
site at the double underlined LE site and a Hindlll site at the double-underlined KL site. - 



Please replace the paragraph beginning at page 13, line 23, with the following 
rewritten paragraph: 

— A preferred minibody presentation structure is as follows: 
MGRNSQATSGFTFSHFYMEWVRGGEYIAASR HKHNKY TTEYSASVKGRYIVSRDTSQS 
ILYLQKKKGPP (SEQ ID NO:1 1 ). The bold, underline regions are the regions which may be 
randomized. The italicized phenylalanine must be invariant in the first randomizing region. 
The entire peptide is cloned in a three-oligonucleotide variation of the coiled-coil embodiment, 
thus allowing two different randomizing regions to be incorporated simultaneously. This 
embodiment utilizes non-palindromic BstXI sites on the termini - 



Please replacethe paragraph beginning at page 1 4, line 6, with thefollowing rewritten 
paragraph: „ — — — _ 




— I n a preferred embodiment, the presentation sequence confers the ability to bind metal ions 
to confer secondary structure. Thus l forexample,C2H2zincfingersequencesareused; C2H2 
sequences have two cysteines and two histidines placed such that a zinc ion is chelated. Zinc 
finger domains are known to occur independently in multiple zinc-finger peptides to form 
structurally independent, flexibly linked domains. See J. Mol. Biol. 228:61 9 (1992). A general 
consensus sequence is (5 amino acids)-C-(2 to 3 amino acids)-C-(4 to 12 amino acids)-H-(3 
amino acids)-H-(5 amino acids). A preferred example would be -FQCEEC-random peptide 
of 3 to 20 amino acids-HIRSHTG- (SEQ ID NO:12). - 
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Please replace the parag raph beg inning at page 1 4, line 1 3, with the following rewritten 
paragraph: 




— Similarly, CCHC boxescan be used (see Biochem. Biophys. Res. Commun. 242:385 ( 1 998)), 
that have a consensus sequence -C-(2 amino acids)-C-(4 to 20 random peptide)-H-(4 amino 
acids)-C- (see Bavoso et al., Biochem. Biophys. Res. Comm. 242(2):385 (1998), hereby 
incorporated by reference. Preferred examples include (1 ) -VKCFNC-4 to 20 random amino 
acids-HTARNCR- (SEQ ID NO: 13), based on the nucleocapsid protein P2; (2) a sequence 
modified from that of the naturally occurring zinc-binding peptide of the Lasp-1 LIM domain 
(Hammarstrom et al., Biochem. 35:12723 (1996)); and (3) -MNPNCARCG-4 to 20 random 
amino acids-HKACF- (SEQ ID NO:14), based on the nmr structural ensemble 1ZFP 
(Hammarstrom et al., Biochem. 35 U.S.C. 35(39):12723 (1996). - 



Please replacethe paragraph beginning at page 1 5, line 9, with the following rewritten 
paragraph: 




— Suitable dimerization sequences will encompass a wide variety of sequences. Any number 
of protein-protein interaction sites are known. In addition, dimerization sequences may also 
be elucidated using standard methods such as the yeast two hybrid system, traditional 
biochemical affinity binding studies, or even using the present methods. See U. S.S.N. 
60/080,444, filed April 2, 1998, hereby incorporated by reference in its entirety. Particularly 
preferred dimerization peptide sequences include, but are not limited to, -EFLIVKS- (SEQ ID 
NO:15), EEFLIVKKS- (SEQ ID NO:16), -FESIKLV- (SEQ ID NO:17), and -VSIKFEL- (SEQ 
ID NO:18). - _ 



Please replacethe paragraph beginning at page 1 6, line 1 , with thefollowing rewritten 
paragraph: - 

- In a preferred embodiment, the targeting sequence is a nuclear localization signal (NLS). 
NLSs are generally short, positively charged (basic) domains that serve to direct the entire 
protein in which they occur to the cell's nucleus. Numerous NLS amino acid sequences have 
been reported including single basic NLS's such as that of the SV40 (monkey virus) large T 
Antigen (Pro Lys Lys Lys Arg Lys Val (SEQ ID NO: 19)), Kalderon (1984), etal., Cell, 39:499- 
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509; the human retinoicacid receptor-finuclearlocalization signal (ARRRRP(SEQ ID NO:20)); 
NFkB p50 (EEVQRKRQKL (SEQ ID NO:21); Ghosh et al., Cell 62:1019 (1990); NFkB p65 
(EEKRKRTYE(SEQ ID NO:22); Nolan etal., Cell64:961 (1991); and others (see for example 
Boulikas, J. Cell. Biochem. 55(1 ):32-58 (1 994), hereby incorporatedby reference) and double 
basic NLS's exemplified by that of the Xenopus (African clawed toad) protein, nucleoplasms 
(Ala Val Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gin Ala Lys Lys Lys Lys Leu Asp (SEQ ID 
NO:23)), Dingwall, etal., Cell, 30:449-458, 1982 and Dingwall, etal., J. Cell Biol., 107:641-849; 
1 988). N umerous localization studies have demonstrated that NLSs incorporated in synthetic 
peptides or grafted onto reporterproteins not normally targeted to the cell nucleus cause these 
peptides and reporter proteins to be concentrated in the nucleus. See, for example, Dingwall, 
and Laskey, Ann, Rev. Cell Biol. ,2:367-390, 1986; Bonnerot.etal., Proc. Natl. Acad. Sci. USA, 
84:6795-6799, 1987; Galileo, etal., Proc. Natl. Acad. Sci. USA, 87:458-462, 1990. - 

Please replace the paragraph beginning at page 1 7, line 12, with the following rewritten 
paragraph: 




— Useful sequences include sequences from: 1) class I integral membrane proteins such as 
IL-2 receptor beta-chain (residues 1-26 are the signal sequence, 241-265 are the 
transmembrane residues; see Hatakeyama et al., Science 244:551 (1989) and von Heijne 
et al, Eur. J. Biochem. 174:671 (1988)) and insulin receptor P-chain (residues 1-27 are the 
signal, 957-959 are the transmembrane domain and 960-1 382 are the cytoplasmic domain; 
see Hatakeyama, supra, and Ebina etal., Cell 40:747 (1985)); 2) class II integral membrane 
proteins such as neutralendopeptidase(residues29-51 are the transmembrane domain, 2-28 
are the cytoplasmic domain; see Malfroy et al., Biochem. Biophys. Res. Commun. 144:59 
(1987)); 3) type III proteins such as human cytochrome P450 NF25 (Hatakeyama, supra); 
and 4) type IV proteins such as human P-glycoprotein (Hatakeyama, supra). Particularly 
preferred are CD8 and ICAM-2. For example, the signal sequences from CD8 and ICAM-2 
lie at the extreme 5' end of the transcript. These consist of the amino acids 1 -32 in the case 
ofCD8(MASPLTRFLSLNLLLLGESILGSGEAKPQAF(SEQ ID NO:24); Nakauchietal., PNAS 
USA 82:5126 (1985) and 1-21 in the case of ICAM-2 (MSSFGYRTLTVALFTLICCPG (SEQ 
ID NO:25); Staunton etal., Nature (London) 339:61 (1989)). These leader sequences deliver 
the construct to the membrane while the hydrophobic transmembrane domains, placed 3' of 
the random peptide region, serve to anchor the construct in the membrane. These 
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transmembrane domains are encompassed by amino acids 145-195 from CD8 
(PQRPEDCRPRGSVKGTGLDFACDIYIWAPLAG!CVALLLSLIITLICYHSR(SEQ ID NO:26); 
Nakauchi, supra) and 224-256 from ICAM-2 (MVIIVTWSVLLSLFVTSVLLCFIFGQHLRQQR 
(SEQ ID NO:27); Staunton, supra). - 



Please replace the paragraph beginning at page 1 7, line 31 , with the following rewritten 
paragraph: 




- Alternatively, membrane anchoring sequences include the GPI anchor, which results in a 
covalent bond between the molecule and the lipid bilayer via a glycosyl-phosphatidylinositol 
bond for example in DAF (PNKGSGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT (SEQ ID 
NO:28), with the bolded serinethe site of the anchor; see Homans et al. , Nature 333(61 70):269- 
72 (1988), and Moran et al., J. Biol. Chem. 266:1250 (1991)). In order to do this, the GPI 
sequence from Thy-1 can be cassetted 3' of the variable region in place of a transmembrane 
sequence. - 



Please replace the paragraph beginning at page 1 8, line 3, with the following rewritten 
paragraph: 



-Similarly, myristylationsequencescan serve as membraneanchoringsequences. It is known 
that the my ristylation of c-src recruits it to the plasma membrane. This is a simple and effective 
method of membrane localization, given that the first 14 amino acids of the protein are solely 
responsible for this function: MGSSKSKPKDPSQR (SEQ ID NO:29) (see Cross et al., Mol. 
Cell. Biol. 4(9): 1 834 (1 984); Spencer et al. , Science 262: 1 01 9-1 024 (1 993), both of which are 
hereby incorporated by reference). This motif has already been shown to be effective in the 
localization of reporter genes and can be used to anchor the zeta chain of the TCR. This motif 
is placed 5' of the variable region in order to localize the construct to the plasma membrane. 
Other modifications such as palmitoylation can be used to anchor constructs in the plasma 
membrane;for example, palmitoylation sequences from the G protein-coupledreceptorkinase 
GRK6 sequence (LLQRLFSRQDCCGNCSDSEEELPTRL (SEQ IDNO:30), with the bold 
cysteines being palmitolyated; Stoffeletal., J. Biol. Chem 269:27791 (1994)); from rhodopsin 
(KQFRNCMLTSLCCGKNPLGD(SEQ ID NO:31 ); Barnstableetal., J. Mol. Neurosci. 5(3):207 
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yjf (1994)); and the p21 H-ras 1 protein (LNPPDESGPGCMSCKCVLS (SEQ ID NO:32); Capon 

etal., Nature 302:33(1983)).- 



Please replace the paragraph beginning at page 18, line 16, with the following rewritten 
paragraph: 




- In a preferred embodiment, the targeting sequence is a lysozomal targeting sequence, 
including, for example, a lysosomal degradation sequence such as Lamp-2 (KFERQ (SEQ 
ID NO:33); Dice, Ann. N.Y. Acad. Sci. 674:58 (1 992); or lysosomalmembranesequencesfrom 
Lamp-1 (MLIPiAGFFALAGLVLIVLIAYL IGRKRSHAGYQT\ (SEQ ID NO:34) , Uthayakumar 
et al., Cell. Mol. Biol. Res. 41:405 (1 995)) or Lamp-2 
(LVPIAVGAALAGVLILVLLAYFIG LKHHHAGYEQF (SEQ IDNO:35). Konecki etla.. Biochem. 
Biophys. Res. Comm. 205:1-5 (1994), both of which show the transmembrane domains in 
italics and the cytoplasmic targeting signal underlined). - 



Please replace the paragraph beginning at page 1 8, line 23, with the following rewritten 
paragraph: 

- Alternatively ,the targeting sequence may be a mitochondriallocalizationsequence, including 
mitochondrial matrix sequences (e.g. yeast alcohol dehydrogenase III; 
MLRTSSLFTRRVQPSLFSRNILRLQST (SEQ ID NO:36); Schatz, Eur. J. Biochem. 165:1-6 
(1 987)); mitochondrial inner membrane sequences (yeast cytochrome c oxidase subunit IV; 
MLSLRQSIRFFKPATRTLCSSRYLL (SEQ ID NO:37); Schatz, supra); mitochondrial 
i n te rme m b ra n e space sequences (yeast cytochrome c1; 
MFSMLSKRWAQRTLSKSFYSTATGAASKSGKLTQKLVTAGVAAAGITASTLLYADSLTAEAMTA 
(SEQ ID NO:38); Schatz, supra) or mitochondrial outer membrane sequences (yeast 70 kD 
outer membraneprotein;MKSFITRNKTAILATVAATGTAIGAYYYYNQLQQQQQRGKKSEQ 
ID NO:39); Schatz, supra).- 



Please replace the paragraph beginning at page 1 8, line 31 , with the following rewritten 
paragraph: 
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(1992)) or adenovirus E3/19K protein (LYLSRRSFIDEKKMP (SEQ ID NO:41); Jackson et 
al., EMBO J. 9:3153 (1990). - 

Please replace the paragraph beginning at page 1 9, line 1 , with the following rewritten 
paragraph: 




- Furthermore, targeting sequences also include peroxisome sequences (for example, the 
peroxisome matrix sequence from Luciferase; SKL; Keller etal., PNAS USA4:3264 (1987)); 
farnesylation sequences (for example, P21 H-ras 1; LNPPDESGPGCMSCKCVLS (SEQ ID 
NO:42), with the bold cysteine farnesylated; Capon, supra); geranylgeranylation sequences 
(forexample, protein rab-5A; LTEPTQPTRNQCCSN (SEQ ID NO:43), with the bold cysteines 
geranylgeranylated;Farnsworth,PNAS USA 91:1 1963 (1994)); or destruction sequences (cyclin 
B1; RTALGDIGN (SEQ ID NO:44); Klotzbucher et al., EMBO J. 1:3053(1996)).- 



Please replace the paragraph beginning at page 1 9, line 25, with the following rewritten 
paragraph: 




- Suitable secretory sequences are known, including signals from IL-2 
(MYRMQLLSCIALSLALVTNS(SEQ ID NO:45); Villinger etal., J. Immunol. 155:3946 (1995)), 
growth hormone (MATGSRTSLLLAFGLLCLPWLQEGSA FPT (SEQ ID NO:46) : Roskam et 
al., Nucleic Acids Res. 7:30(1 979)); preproinsulin(MALWMRLLPLLALLALWGPDPAAAFVN 
(SEQ ID NO:47); Bell et al., Nature 284:26 (1980)); and influenza HA protein 
(MKAKLLVLLYAFVAGDQI (SEQ ID NO:48);Sekiwawaetal., PNAS 80:3563)), with cleavage 
between the non-underlined-underlined junction. A particularly preferred secretory signal 
sequence is the signal leader sequence from the secreted cytokine IL-4, which comprises 
the first 24 amino acids of IL-4 as follows: MGLTSQLLPPLFFLLACAGNFVHG (SEQ ID 
NO:49).~ 



i/l 




Please replace the paragraph beginning at page 1 9, line 33, with the following rewritten 
paragraph: 



- In a preferred embodiment, the fusion partner is a rescue sequence. A rescue sequence 
is a sequence which may be used to purify or isolate either the peptide or the nucleic acid 
encoding it. Thus, for example, peptide rescue sequences include purification sequences 
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such as the His 6 (SEQ ID NO:50) tag for use with Ni affinity columns and epitope tags for 
detection jmmunoprecipitation or FACS (fluoroscence-activatedcell sorting). Suitableepitope 
tags include myc(foruse with thecommerciallyavailable9E1 0 antibody), the BSPbiotinylation 
target sequence of the bacterial enzyme BirA, flu tags, lacZ, GST, and Strep tag I and II. - 



Please replacethe paragraph beginning at page 20, line 9, with the following rewritten 
paragraph: 



- In a preferred embodiment, the fusion partner is a stability sequence to confer stability to 
the peptide or the nucleic acid encoding it. Thus, for example, peptides may be stabilized 
by the incorporation of glycines after the initiation methionine (MG or MGG0), for protection 
of the peptide to ubiquitination as per Varshavsky's N-End Rule, thus conferring long half-life 
in the cytoplasm. Similarly, two prolines at the C-terminus impart peptides that are largely 
resistant to carboxypeptidaseaction. The presence of two glycines prior to the prolines impart 
both flexibility and prevent structure initiating events in the di-proline to be propagated into 
the peptide structure. Thus, preferred stability sequences are as follows: MG(X) n GGPP(SEQ 
ID NO:51 ), where X is any amino acid and n is an integer of at least four. - 



Please replace the paragraph beginning at page 20, line 25, with the following rewritten 
paragraph: 




- In a preferred embodiment.thefusionpartnerincludesalinkerortethering sequence. Linker 
sequences between various targeting sequences (for example, membranetargeting sequences) 
and the other components of the constructs(such as the randomized peptides) may be desirable 
to allow the peptides to interact with potential targets unhindered. For example, useful linkers 
include glycine polymers (G) n ,glycine-sennepolymers(including,for example, (GS) n , (GSGGS) n 
(SEQ ID NO:52) and (GGGS) n , (SEQ ID NO:53) where n is an integer of at least one), glycine- 
alanine polymers, alanine-serine polymers, and other flexible linkers such as the tether for 
the shakerpotassiumchannel, and a large variety of otherflexible linkers, as will beappreciated 
by those in the art. Glycine and glycine-serine polymers are preferred since both of these 
amino acids are relatively unstructured, and therefore may be able to serve as a neutral tether 
between components. Glycine polymers are the most preferred as glycineaccessessignificantly 
more phi-psi space than even alanine, and is much less restricted tan residues with longer 
side chains (see Scheraga, Rev. Computational Chem. 11173-142 (1992)). Secondly, serine 
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is hydrophilic and therefore able to solubilize what could be a globular glycine chain. Third, 
similar chains have been shown to be effective in joining subunits of recombinant proteins 
such as single chain antibodies. - 



Please replace the paragraph beginning at page 22, line 9 , with the following rewritten 
paragraph: 




- The present invention has specifically contemplated each and every possible variation of 
polynucleotide that could be made by selecting combinations based on the possible codon 
choices, and all such variations are to be considered specifically disclosed and equivalent 
to the sequences of Figures 2 and 3 (SEQ ID NOS:1 , 3-4). Codons are preferably selected 
to fit the host cell in which the enzyme is being produced; that is, codon usage for yeast is 
used to express in yeast; codon usage for mammalian cells is used to express in mammalian 
cells; etc. Selection of codons to maximize expression of proteins in a heterologous host is 
a known tec hnique. - 

Please replacethe paragraph beginning at page 54, line 3, with the following rewritten 
paragraph: „ 




- A number of cyclin destruction boxes are known in the art, for example, cyclin A has a 
destruction box comprising the sequence RTVLGVIGD (SEQ ID NO:54); the destruction 
box of cyclin B1 comprises the sequence RTALGDIGN (SEQ ID NO:55). See Glotzer et 
aL, Nature 349:132-138 (1991). Other destruction boxes are known as well: 
YMTVSIIDRFMQDSCVPKKMLQLVGVT (SEQ ID NO:56; rat cyclin B); 
KFRLLQETMYMTVSIIDRFMQNSCVPKK (SEQ ID NO:57; mouse cyclin B); 
RAILIDWLIQVQMKFRLLQETMYMTVS (SEQ ID NO:58; mouse cyclin B1); 
DRFLQAQLVCRKKLQWGITALLLASK (SEQ ID NO:59; mouse cyclin B2); and 
MSVLRGKLQLVGTAAMLL (SEQ ID NO:60; mouse cyclin A2). - 

Please replace the paragraph beginning at page 69, line 25, with the following 
rewritten paragraph: 




- Retroviral constructs were based on a pCGFP vector that carries a composite CMV 
/ y\2) promoter fused to the transcriptional start site fo the MMLV R-U5 region of the LTR, and 
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extended packaging sequence, deletion of the MMLV gag start ATG, and a multiple 
cloning region encoding human codon-optimized EGFP (Clontech, Palo Alto, CA) and a 
) Kozak consensus start, described in Kozak Cell 44:283-292. The vector used to express 



open reading frame of EGFP (resulting in 8 non-human optimized codons) and a Flag tag 
fused to the C-terminus of EGFP with the linker EEAAKA (SEQ ID NO:61 ). - 

Please replace the paragraph beginning at page 70, line 3, with the following 
rewritten paragraph: 

pR and pP are retroviral expression vectors containing human codon-optimized Renilla 
muelleri and Ptilosarcus gurneyi GFPs (containing 9 and 11 non-optimized codons, 
respectively, to introduce restriction sites). Each has a Kozak consensus start and 
backbone vector sequence identical to that of pCGFP and pEf. These vectors were made 
by annealing and ligating 20 synthetic oligonucleotides (10 forward, 10 reverse for each 
GFP gene) creating a dsDNA fragment for each sequence shown in Table 1. These 



GATCATAGAATTCGCCACCATGGGCAGCAAGCAGATCCTGAAGAACACCTGCCTG 
(SEQ ID NO:62); 

P forward, 5'- 

GATCATAGAATTCGCCACCATGGGCAACCGCAACGTGCTGAAGAACACCGGCCTG 
(SEQ ID NO:63); 

R and P reverse, 5'- . 

ATGATCGCGGCCGCTACACCCACTCGTGCAGGGATCCCAGGGGCTTGCCGATG 
(SEQIDNO:64); 

and cloned into the EcoRI/Notl restriction sites of pEf (replacing the Ef coding region). C- 
terminal Flag tags were added to these GFPs through BamHI/Notl sites using annealed 
primers with sticky overhangs: 

Forward, 5' - 




flag tagged EGFP, pEf, is identical to pCGFP but has additional restriction sites in the 




fragments were PCR amplified with respective primers: 



R forward, 5' - 
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GATCCCTGCACGAGTGGGTGGAGGAGGCCGCCAAGGCCGACTACAAGGACGACGA 
CGACAAGTAGGCCCGTGAGGCCCTAAGC (SEQ ID NO:65); 

Reverse, 5* - 

GGCCGCTTAGGGCCTCACGGGCCTACTTGTCGTCGTCCTTGTAGTCGGCCTTGGCG 
GCCTCCTCCACCCACTCGTGCAGG (SEQ ID NO:66); 

creating Rf and Pf. pRcDNA was made by removing the R muelleri cDNA gene from 
pET-34 Native Renilla muelleri GFP (Prolume Ltd., Pittsburg, PA) by PGR amplification 
with primers: 

Forward, 5' - 

GATCATGAATTCGCCATGAGTAAACAAATATTGAAGAACACT (SEQ ID NO:67); 
Reverse, 5* - 

TAGATCGCGGCCGCTTAAACCCATTCGTGTAAGGATCCTAGTGG (SEQ ID NO:68); 

and cloning into the EcoRI/Notl sites of pEf. Vectors containing codon optimized R 
muelleri GFP with a linker-HA tag-linker sequence inserted into each position A-F were 
created by the PCR sew technique of two fragments using primers shown above (R 
forward and R reverse). The two fragments for A-F were made by PCR amplification of 
the 5' section of R with respective primers: 

R forward, shown above; 

A reverse, 5' - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCCTCGTAGCGCA 
GGGTGCGCTCGTAC (SEQ ID NO:69); 

B reverse, 5' - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCCTCGATCAGGT 
TGATGTCGCTGCGG (SEQ ID NO:70); 

C reverse, 5' - 
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CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGTTCATGTACA 
TGGCCTCGAAGCTG (SEQ ID NO:71); 

D reverse, 5' - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGTTAAGCTTGT 
ACACAGGATCACC (SEQ ID NO:72); 

E reverse, 5' - 

CTGGCGTAGTCGGGACGTCGTAGGGGTAGCCACCGAAATGGAAGAAATTGCTCTTC 
ATCAGGGTCTTC (SEQ ID NO:73); 

F reverse, 5' - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGCCGCCGTCCT 
CCACGTAGGTCTTC (SEQ ID NO:74); 

and the 3' section of R with respective primers: 

A forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAGCAGGTGGAGGCGACGGCGG 
C CTG GTG G AG ATCCGC A (SEQ ID NO:75); 

B forward, 5" - 

CCTACGACGTGCCCGACTAGCCAGCCTGGGCCAAGCAGGTGGAGGCGACAAGTTC 
GTGTACCGCGTGGAGT (SEQ ID NO:76); 

C forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCAACGGCGT 
GCTGGTGGGCGAGGTGA (SEQ ID NO:77); 

D forward, 5" - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCAGCGGCAA 
GTACTACAGCTGCCACA (SEQ ID NO:78); 

E forward, 5' - 
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CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCGTGGTGAA 
GGAGTTCCCCAGCTACC (SEQ ID NO:79); 

F forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCTTCGTGGA 
GCAGCACGAGACCGCCA (SEQ ID NO:80). The PCR sewed fragments were put into 
the EcoRI/Notl sites of pEf. - 

Please replace the paragraph beginning at page 71, line 36, with the following 
rewritten paragraph: — 

- The bacterial expression vector for purification of Ptilosarcus GFP was created by PCR 
amplfication of pP with primers: 

forward, 5' - 

AGATCATAGATCTATGGGCAACCGCAACGTGCTGAAGAACACCGGCCTG (SEQ ID 
NO:81); 

P reverse, shown above. 

Digestion of the fragment withBglll/Notl and ligation into the BamHI/Notl restriction sites of 
pGEX6P-1 (Pharmacia Biotech, Piscataway, New Jersey). The vector containing R. 
muelleri GFP with C10G and C35E mutations (observed to aid in the folding of the protein 
in bacteria) was created by PCR sewing together a fragment created by annealing and 
extending primers: 

forward, 5' - 

AGATCATAGATCTGAATTCATGGGCAGCAAGCAGATCCTGAAGAACACCGGCCTGC 
AGGAGGTGATGAGCTACAAGGTGACCTGGAGG (SEQ ID NO:82); 

reverse, 5' - 

GCCAACAGGATGTTGCCCTTGCCCTCGCCCTCCATGGTGAACACGTGGTTGTTAAC 
GATGCCCTCCAGGTTCACCTTGTAGCTCATCAC (SEQ ID NO:83); 

R reverse, shown above. 
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